A Modified UCT Algorithm Basd on Risk Estimation Methods

نویسندگان

  • Jiajia Zhang
  • Xuan Wang
چکیده

Risk dominance and payoff dominance strategy are two complementary parts of the game theory decision strategy. While payoff dominance is still the basic principle in perfect information, two player games, risk dominance has shown its advantages in imperfect information conditions. In this paper, we first review the related work in the area of estimation methods and the influence of risk factors on computing game equilibrium. Then a new algorithm, UCT-Risk is proposed in this paper, which is a modification of UCT (UCB apply to Trees) algorithm based on risk estimation methods. Finally, we implement the proposed algorithm in SiGuo game, a popular imperfect information game in China. The experimental result of the new algorithm shows it correctness and effectiveness.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Modified UCT Algorithm Basing on Risk Estimation Methods in Imperfect Information Games

Risk dominance and payoff dominance strategy are two complementary parts of the game theory decision strategy. While payoff dominance is still the basic principle in perfect information, two player games, risk dominance has shown its advantages in imperfect information conditions. In this paper, we first review the related work in the area of estimation methods and the influence of risk factors...

متن کامل

Automatic Bounding Estimation in Modified Nlms Algorithm

Modified Normalized Least Mean Square (MNLMS) algorithm, which is a sign form of NLMS based on set-membership (SM) theory in the class of optimal bounding ellipsoid (OBE) algorithms, requires a priori knowledge of error bounds that is unknown in most applications. In a special but popular case of measurement noise, a simple algorithm has been proposed. With some simulation examples the performa...

متن کامل

Identification of outliers types in multivariate time series using genetic algorithm

Multivariate time series data, often, modeled using vector autoregressive moving average (VARMA) model. But presence of outliers can violates the stationary assumption and may lead to wrong modeling, biased estimation of parameters and inaccurate prediction. Thus, detection of these points and how to deal properly with them, especially in relation to modeling and parameter estimation of VARMA m...

متن کامل

A New Approach to Software Cost Estimation by Improving Genetic Algorithm with Bat Algorithm

Because of the low accuracy of estimation and uncertainty of the techniques used in the past to Software Cost Estimation (SCE), software producers face a high risk in practice with regards to software projects and they often fail in such projects. Thus, SCE as a complex issue in software engineering requires new solutions, and researchers make an effort to make use of Meta-heuristic algorithms ...

متن کامل

Generalized Rapid Action Value Estimation

Monte Carlo Tree Search (MCTS) is the state of the art algorithm for many games including the game of Go and General Game Playing (GGP). The standard algorithm for MCTS is Upper Confidence bounds applied to Trees (UCT). For games such as Go a big improvement over UCT is the Rapid Action Value Estimation (RAVE) heuristic. We propose to generalize the RAVE heuristic so as to have more accurate es...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014